Dataset statistics
| Number of variables | 40 |
|---|---|
| Number of observations | 3329147 |
| Missing cells | 65665372 |
| Missing cells (%) | 49.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1016.0 MiB |
| Average record size in memory | 320.0 B |
Variable types
| Categorical | 16 |
|---|---|
| Numeric | 17 |
| Unsupported | 7 |
id_mutation has a high cardinality: 1432599 distinct values | High cardinality |
date_mutation has a high cardinality: 364 distinct values | High cardinality |
adresse_nom_voie has a high cardinality: 484404 distinct values | High cardinality |
adresse_code_voie has a high cardinality: 16111 distinct values | High cardinality |
nom_commune has a high cardinality: 30535 distinct values | High cardinality |
ancien_nom_commune has a high cardinality: 591 distinct values | High cardinality |
id_parcelle has a high cardinality: 2032421 distinct values | High cardinality |
ancien_id_parcelle has a high cardinality: 10339 distinct values | High cardinality |
code_nature_culture_speciale has a high cardinality: 125 distinct values | High cardinality |
nature_culture_speciale has a high cardinality: 125 distinct values | High cardinality |
code_postal is highly correlated with ancien_code_commune | High correlation |
ancien_code_commune is highly correlated with code_postal | High correlation |
lot1_surface_carrez is highly correlated with lot5_surface_carrez and 3 other fields | High correlation |
lot2_surface_carrez is highly correlated with lot3_surface_carrez and 4 other fields | High correlation |
lot3_surface_carrez is highly correlated with lot2_surface_carrez and 3 other fields | High correlation |
lot4_surface_carrez is highly correlated with lot2_surface_carrez and 2 other fields | High correlation |
lot5_surface_carrez is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
code_type_local is highly correlated with nombre_pieces_principales | High correlation |
surface_reelle_bati is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
nombre_pieces_principales is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
surface_terrain is highly correlated with lot1_surface_carrez | High correlation |
code_postal is highly correlated with ancien_code_commune | High correlation |
ancien_code_commune is highly correlated with code_postal | High correlation |
lot1_surface_carrez is highly correlated with lot5_surface_carrez and 1 other fields | High correlation |
lot2_surface_carrez is highly correlated with lot4_surface_carrez and 1 other fields | High correlation |
lot3_surface_carrez is highly correlated with lot4_surface_carrez and 1 other fields | High correlation |
lot4_surface_carrez is highly correlated with lot2_surface_carrez and 2 other fields | High correlation |
lot5_surface_carrez is highly correlated with lot1_surface_carrez and 4 other fields | High correlation |
code_type_local is highly correlated with nombre_pieces_principales | High correlation |
surface_reelle_bati is highly correlated with lot5_surface_carrez | High correlation |
nombre_pieces_principales is highly correlated with code_type_local | High correlation |
surface_terrain is highly correlated with lot1_surface_carrez | High correlation |
code_postal is highly correlated with ancien_code_commune | High correlation |
ancien_code_commune is highly correlated with code_postal | High correlation |
lot1_surface_carrez is highly correlated with surface_reelle_bati and 1 other fields | High correlation |
lot2_surface_carrez is highly correlated with surface_reelle_bati and 1 other fields | High correlation |
lot3_surface_carrez is highly correlated with surface_reelle_bati | High correlation |
lot4_surface_carrez is highly correlated with lot5_surface_carrez | High correlation |
lot5_surface_carrez is highly correlated with lot4_surface_carrez | High correlation |
code_type_local is highly correlated with nombre_pieces_principales | High correlation |
surface_reelle_bati is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
nombre_pieces_principales is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
code_nature_culture is highly correlated with nature_culture | High correlation |
nature_culture is highly correlated with code_nature_culture | High correlation |
code_type_local is highly correlated with type_local | High correlation |
type_local is highly correlated with code_type_local | High correlation |
adresse_numero is highly correlated with adresse_suffixe | High correlation |
adresse_suffixe is highly correlated with adresse_numero and 2 other fields | High correlation |
code_postal is highly correlated with ancien_code_commune and 1 other fields | High correlation |
ancien_code_commune is highly correlated with adresse_suffixe and 1 other fields | High correlation |
lot1_surface_carrez is highly correlated with lot4_surface_carrez and 3 other fields | High correlation |
lot2_surface_carrez is highly correlated with lot3_surface_carrez and 2 other fields | High correlation |
lot3_surface_carrez is highly correlated with lot2_surface_carrez and 2 other fields | High correlation |
lot4_surface_carrez is highly correlated with adresse_suffixe and 4 other fields | High correlation |
lot5_surface_carrez is highly correlated with lot1_surface_carrez and 3 other fields | High correlation |
code_type_local is highly correlated with type_local | High correlation |
type_local is highly correlated with code_type_local | High correlation |
code_nature_culture is highly correlated with lot1_surface_carrez and 1 other fields | High correlation |
nature_culture is highly correlated with lot1_surface_carrez and 1 other fields | High correlation |
longitude is highly correlated with code_postal and 1 other fields | High correlation |
latitude is highly correlated with longitude | High correlation |
adresse_numero has 1398919 (42.0%) missing values | Missing |
adresse_suffixe has 3184623 (95.7%) missing values | Missing |
ancien_code_commune has 3275534 (98.4%) missing values | Missing |
ancien_nom_commune has 3275534 (98.4%) missing values | Missing |
ancien_id_parcelle has 3315540 (99.6%) missing values | Missing |
numero_volume has 3319162 (99.7%) missing values | Missing |
lot1_numero has 2294729 (68.9%) missing values | Missing |
lot1_surface_carrez has 3039921 (91.3%) missing values | Missing |
lot2_numero has 3111925 (93.5%) missing values | Missing |
lot2_surface_carrez has 3258320 (97.9%) missing values | Missing |
lot3_numero has 3292644 (98.9%) missing values | Missing |
lot3_surface_carrez has 3322123 (99.8%) missing values | Missing |
lot4_numero has 3316424 (99.6%) missing values | Missing |
lot4_surface_carrez has 3327276 (99.9%) missing values | Missing |
lot5_numero has 3323135 (99.8%) missing values | Missing |
lot5_surface_carrez has 3328380 (> 99.9%) missing values | Missing |
code_type_local has 1512780 (45.4%) missing values | Missing |
type_local has 1512780 (45.4%) missing values | Missing |
surface_reelle_bati has 1970649 (59.2%) missing values | Missing |
nombre_pieces_principales has 1515599 (45.5%) missing values | Missing |
code_nature_culture has 1050173 (31.5%) missing values | Missing |
nature_culture has 1050173 (31.5%) missing values | Missing |
code_nature_culture_speciale has 3175092 (95.4%) missing values | Missing |
nature_culture_speciale has 3175092 (95.4%) missing values | Missing |
surface_terrain has 1050235 (31.5%) missing values | Missing |
longitude has 72660 (2.2%) missing values | Missing |
latitude has 72660 (2.2%) missing values | Missing |
numero_disposition is highly skewed (γ1 = 42.34108297) | Skewed |
valeur_fonciere is highly skewed (γ1 = 85.45130559) | Skewed |
lot1_surface_carrez is highly skewed (γ1 = 42.78337816) | Skewed |
lot2_surface_carrez is highly skewed (γ1 = 62.02338371) | Skewed |
lot3_surface_carrez is highly skewed (γ1 = 20.28587031) | Skewed |
lot4_numero is highly skewed (γ1 = 58.24379501) | Skewed |
nombre_lots is highly skewed (γ1 = 46.98893099) | Skewed |
surface_reelle_bati is highly skewed (γ1 = 134.4592681) | Skewed |
surface_terrain is highly skewed (γ1 = 82.35419667) | Skewed |
code_commune is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
code_departement is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
numero_volume is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
lot1_numero is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
lot2_numero is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
lot3_numero is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
lot5_numero is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
nombre_lots has 2294729 (68.9%) zeros | Zeros |
nombre_pieces_principales has 581298 (17.5%) zeros | Zeros |
Reproduction
| Analysis started | 2021-10-05 23:06:37.958047 |
|---|---|
| Analysis finished | 2021-10-05 23:22:15.054046 |
| Duration | 15 minutes and 37.1 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 1432599 |
|---|---|
| Distinct (%) | 43.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 25.4 MiB |
| 2018-1305371 | 3844 |
|---|---|
| 2018-20073 | 2097 |
| 2018-1361763 | 2092 |
| 2018-413181 | 1964 |
| 2018-1103729 | 1489 |
| Other values (1432594) |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 11.19680357 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 690859 ? |
|---|---|
| Unique (%) | 20.8% |
Sample
| 1st row | 2018-1 |
|---|---|
| 2nd row | 2018-1 |
| 3rd row | 2018-2 |
| 4th row | 2018-2 |
| 5th row | 2018-2 |
Common Values
| Value | Count | Frequency (%) |
| 2018-1305371 | 3844 | 0.1% |
| 2018-20073 | 2097 | 0.1% |
| 2018-1361763 | 2092 | 0.1% |
| 2018-413181 | 1964 | 0.1% |
| 2018-1103729 | 1489 | < 0.1% |
| 2018-1371000 | 1221 | < 0.1% |
| 2018-586909 | 1093 | < 0.1% |
| 2018-586745 | 1088 | < 0.1% |
| 2018-805382 | 1039 | < 0.1% |
| 2018-1262363 | 1010 | < 0.1% |
| Other values (1432589) | 3312210 |
Length
| Value | Count | Frequency (%) |
| 2018-1305371 | 3844 | 0.1% |
| 2018-20073 | 2097 | 0.1% |
| 2018-1361763 | 2092 | 0.1% |
| 2018-413181 | 1964 | 0.1% |
| 2018-1103729 | 1489 | < 0.1% |
| 2018-1371000 | 1221 | < 0.1% |
| 2018-586909 | 1093 | < 0.1% |
| 2018-586745 | 1088 | < 0.1% |
| 2018-805382 | 1039 | < 0.1% |
| 2018-1262363 | 1010 | < 0.1% |
| Other values (1432589) | 3312210 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 364 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 25.4 MiB |
| 2018-12-21 | 42118 |
|---|---|
| 2018-12-28 | 38398 |
| 2018-12-20 | 31809 |
| 2018-12-27 | 30771 |
| 2018-06-29 | 30577 |
| Other values (359) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2018-01-03 |
|---|---|
| 2nd row | 2018-01-03 |
| 3rd row | 2018-01-04 |
| 4th row | 2018-01-04 |
| 5th row | 2018-01-04 |
Common Values
| Value | Count | Frequency (%) |
| 2018-12-21 | 42118 | 1.3% |
| 2018-12-28 | 38398 | 1.2% |
| 2018-12-20 | 31809 | 1.0% |
| 2018-12-27 | 30771 | 0.9% |
| 2018-06-29 | 30577 | 0.9% |
| 2018-12-14 | 25708 | 0.8% |
| 2018-09-28 | 24835 | 0.7% |
| 2018-11-30 | 24810 | 0.7% |
| 2018-04-27 | 23991 | 0.7% |
| 2018-12-19 | 23450 | 0.7% |
| Other values (354) | 3032680 |
Length
| Value | Count | Frequency (%) |
| 2018-12-21 | 42118 | 1.3% |
| 2018-12-28 | 38398 | 1.2% |
| 2018-12-20 | 31809 | 1.0% |
| 2018-12-27 | 30771 | 0.9% |
| 2018-06-29 | 30577 | 0.9% |
| 2018-12-14 | 25708 | 0.8% |
| 2018-09-28 | 24835 | 0.7% |
| 2018-11-30 | 24810 | 0.7% |
| 2018-04-27 | 23991 | 0.7% |
| 2018-12-19 | 23450 | 0.7% |
| Other values (354) | 3032680 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 362 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.22082173 |
| Minimum | 1 |
|---|---|
| Maximum | 362 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 362 |
| Range | 361 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4.983262757 |
|---|---|
| Coefficient of variation (CV) | 4.081892249 |
| Kurtosis | 1994.248059 |
| Mean | 1.22082173 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 42.34108297 |
| Sum | 4064295 |
| Variance | 24.8329077 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3115008 | |
| 2 | 175601 | 5.3% |
| 3 | 24859 | 0.7% |
| 4 | 4792 | 0.1% |
| 5 | 2214 | 0.1% |
| 6 | 684 | < 0.1% |
| 7 | 419 | < 0.1% |
| 8 | 414 | < 0.1% |
| 9 | 222 | < 0.1% |
| 12 | 182 | < 0.1% |
| Other values (352) | 4752 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 3115008 | |
| 2 | 175601 | 5.3% |
| 3 | 24859 | 0.7% |
| 4 | 4792 | 0.1% |
| 5 | 2214 | 0.1% |
| 6 | 684 | < 0.1% |
| 7 | 419 | < 0.1% |
| 8 | 414 | < 0.1% |
| 9 | 222 | < 0.1% |
| 10 | 169 | < 0.1% |
| Value | Count | Frequency (%) |
| 362 | 1 | |
| 361 | 1 | |
| 360 | 1 | |
| 359 | 2 | |
| 358 | 1 | |
| 357 | 1 | |
| 356 | 1 | |
| 355 | 1 | |
| 354 | 1 | |
| 353 | 1 |
nature_mutation
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 25.4 MiB |
| Vente | |
|---|---|
| Vente en l'état futur d'achèvement | 256130 |
| Echange | 47457 |
| Vente terrain à bâtir | 13758 |
| Adjudication | 12900 |
Length
| Max length | 34 |
|---|---|
| Median length | 5 |
| Mean length | 7.358908153 |
| Min length | 5 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Vente |
|---|---|
| 2nd row | Vente |
| 3rd row | Vente |
| 4th row | Vente |
| 5th row | Vente |
Common Values
| Value | Count | Frequency (%) |
| Vente | 2996397 | |
| Vente en l'état futur d'achèvement | 256130 | 7.7% |
| Echange | 47457 | 1.4% |
| Vente terrain à bâtir | 13758 | 0.4% |
| Adjudication | 12900 | 0.4% |
| Expropriation | 2505 | 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| vente | 3266285 | |
| d'achèvement | 256130 | 5.8% |
| futur | 256130 | 5.8% |
| l'état | 256130 | 5.8% |
| en | 256130 | 5.8% |
| echange | 47457 | 1.1% |
| bâtir | 13758 | 0.3% |
| à | 13758 | 0.3% |
| terrain | 13758 | 0.3% |
| adjudication | 12900 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 132875 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 31915 |
| Missing (%) | 1.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 800188.1149 |
| Minimum | 0.13 |
|---|---|
| Maximum | 1256965630 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.4 MiB |
Quantile statistics
| Minimum | 0.13 |
|---|---|
| 5-th percentile | 2370 |
| Q1 | 56000 |
| median | 143700 |
| Q3 | 260000 |
| 95-th percentile | 1245000 |
| Maximum | 1256965630 |
| Range | 1256965630 |
| Interquartile range (IQR) | 204000 |
Descriptive statistics
| Standard deviation | 12078005.65 |
|---|---|
| Coefficient of variation (CV) | 15.09395781 |
| Kurtosis | 8596.509286 |
| Mean | 800188.1149 |
| Median Absolute Deviation (MAD) | 96688 |
| Skewness | 85.45130559 |
| Sum | 2.638405859 × 1012 |
| Variance | 1.458782204 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100000 | 28692 | 0.9% |
| 150000 | 27861 | 0.8% |
| 1 | 27224 | 0.8% |
| 120000 | 27110 | 0.8% |
| 80000 | 24033 | 0.7% |
| 50000 | 23887 | 0.7% |
| 110000 | 22895 | 0.7% |
| 90000 | 22884 | 0.7% |
| 200000 | 22804 | 0.7% |
| 130000 | 22384 | 0.7% |
| Other values (132865) | 3047458 | |
| (Missing) | 31915 | 1.0% |
| Value | Count | Frequency (%) |
| 0.13 | 2 | < 0.1% |
| 0.15 | 118 | |
| 0.16 | 5 | < 0.1% |
| 0.17 | 2 | < 0.1% |
| 0.18 | 50 | |
| 0.19 | 1 | < 0.1% |
| 0.2 | 2 | < 0.1% |
| 0.25 | 2 | < 0.1% |
| 0.27 | 2 | < 0.1% |
| 0.3 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1256965630 | 40 | < 0.1% |
| 1249132030 | 205 | |
| 629591420 | 3 | < 0.1% |
| 598015740 | 1 | < 0.1% |
| 477094080 | 12 | < 0.1% |
| 458865760 | 47 | < 0.1% |
| 421000000 | 22 | < 0.1% |
| 362693728 | 1 | < 0.1% |
| 318688800 | 34 | < 0.1% |
| 310600000 | 79 | < 0.1% |
| Distinct | 7201 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 1398919 |
| Missing (%) | 42.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 788.0530373 |
| Minimum | 1 |
|---|---|
| Maximum | 9999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 26 |
| Q3 | 102 |
| 95-th percentile | 5754 |
| Maximum | 9999 |
| Range | 9998 |
| Interquartile range (IQR) | 94 |
Descriptive statistics
| Standard deviation | 2132.927192 |
|---|---|
| Coefficient of variation (CV) | 2.706578227 |
| Kurtosis | 7.165713917 |
| Mean | 788.0530373 |
| Median Absolute Deviation (MAD) | 22 |
| Skewness | 2.885265279 |
| Sum | 1521122038 |
| Variance | 4549378.407 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 81857 | 2.5% |
| 2 | 76666 | 2.3% |
| 3 | 61201 | 1.8% |
| 4 | 59856 | 1.8% |
| 5 | 56079 | 1.7% |
| 6 | 53685 | 1.6% |
| 7 | 47644 | 1.4% |
| 8 | 47113 | 1.4% |
| 10 | 43480 | 1.3% |
| 9 | 40927 | 1.2% |
| Other values (7191) | 1361720 | |
| (Missing) | 1398919 |
| Value | Count | Frequency (%) |
| 1 | 81857 | |
| 2 | 76666 | |
| 3 | 61201 | |
| 4 | 59856 | |
| 5 | 56079 | |
| 6 | 53685 | |
| 7 | 47644 | |
| 8 | 47113 | |
| 9 | 40927 | |
| 10 | 43480 |
| Value | Count | Frequency (%) |
| 9999 | 345 | |
| 9998 | 55 | < 0.1% |
| 9997 | 24 | < 0.1% |
| 9996 | 10 | < 0.1% |
| 9995 | 12 | < 0.1% |
| 9994 | 22 | < 0.1% |
| 9993 | 2 | < 0.1% |
| 9992 | 2 | < 0.1% |
| 9991 | 15 | < 0.1% |
| 9990 | 24 | < 0.1% |
| Distinct | 41 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3184623 |
| Missing (%) | 95.7% |
| Memory size | 25.4 MiB |
| B | |
|---|---|
| A | |
| F | |
| T | |
| C | 4497 |
| Other values (36) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Z |
|---|---|
| 2nd row | Z |
| 3rd row | B |
| 4th row | A |
| 5th row | B |
Common Values
| Value | Count | Frequency (%) |
| B | 83313 | 2.5% |
| A | 22573 | 0.7% |
| F | 14094 | 0.4% |
| T | 11309 | 0.3% |
| C | 4497 | 0.1% |
| D | 2167 | 0.1% |
| E | 1201 | < 0.1% |
| Q | 1060 | < 0.1% |
| P | 754 | < 0.1% |
| G | 557 | < 0.1% |
| Other values (31) | 2999 | 0.1% |
| (Missing) | 3184623 |
Length
| Value | Count | Frequency (%) |
| b | 83313 | |
| a | 22573 | 15.6% |
| f | 14094 | 9.8% |
| t | 11309 | 7.8% |
| c | 4497 | 3.1% |
| d | 2167 | 1.5% |
| e | 1201 | 0.8% |
| q | 1060 | 0.7% |
| p | 754 | 0.5% |
| g | 557 | 0.4% |
| Other values (27) | 2999 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 484404 |
|---|---|
| Distinct (%) | 14.7% |
| Missing | 30430 |
| Missing (%) | 0.9% |
| Memory size | 25.4 MiB |
| LE VILLAGE | 31345 |
|---|---|
| LE BOURG | 26496 |
| RUE JEAN JAURES | 6750 |
| GR GRANDE RUE | 6140 |
| AV JEAN JAURES | 6096 |
| Other values (484399) |
Length
| Max length | 31 |
|---|---|
| Median length | 14 |
| Mean length | 14.65257462 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 165688 ? |
|---|---|
| Unique (%) | 5.0% |
Sample
| 1st row | RUE GEN LOGEROT |
|---|---|
| 2nd row | RUE GEN LOGEROT |
| 3rd row | RUE DE LA BARMETTE |
| 4th row | RUE DE LA BARMETTE |
| 5th row | RUE DE LA BARMETTE |
Common Values
| Value | Count | Frequency (%) |
| LE VILLAGE | 31345 | 0.9% |
| LE BOURG | 26496 | 0.8% |
| RUE JEAN JAURES | 6750 | 0.2% |
| GR GRANDE RUE | 6140 | 0.2% |
| AV JEAN JAURES | 6096 | 0.2% |
| RUE DE LA REPUBLIQUE | 5877 | 0.2% |
| RUE PASTEUR | 5248 | 0.2% |
| AV DE LA REPUBLIQUE | 5008 | 0.2% |
| RUE VICTOR HUGO | 4971 | 0.1% |
| RUE DE PARIS | 4217 | 0.1% |
| Other values (484394) | 3196569 | |
| (Missing) | 30430 | 0.9% |
Length
| Value | Count | Frequency (%) |
| rue | 1086845 | 11.5% |
| de | 715473 | 7.6% |
| la | 488261 | 5.2% |
| du | 329727 | 3.5% |
| le | 288874 | 3.1% |
| des | 278546 | 2.9% |
| av | 254732 | 2.7% |
| les | 230910 | 2.4% |
| che | 104599 | 1.1% |
| rte | 99464 | 1.1% |
| Other values (202213) | 5571404 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 16111 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 30389 |
| Missing (%) | 0.9% |
| Memory size | 25.4 MiB |
| B005 | 15659 |
|---|---|
| B004 | 15081 |
| B009 | 15002 |
| B002 | 14654 |
| B003 | 14515 |
| Other values (16106) |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1383 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1660 |
|---|---|
| 2nd row | 1660 |
| 3rd row | 0025 |
| 4th row | 0025 |
| 5th row | 0025 |
Common Values
| Value | Count | Frequency (%) |
| B005 | 15659 | 0.5% |
| B004 | 15081 | 0.5% |
| B009 | 15002 | 0.5% |
| B002 | 14654 | 0.4% |
| B003 | 14515 | 0.4% |
| B011 | 14508 | 0.4% |
| B008 | 14472 | 0.4% |
| B014 | 14354 | 0.4% |
| B013 | 14298 | 0.4% |
| B006 | 14271 | 0.4% |
| Other values (16101) | 3151944 | |
| (Missing) | 30389 | 0.9% |
Length
| Value | Count | Frequency (%) |
| b005 | 15659 | 0.5% |
| b004 | 15081 | 0.5% |
| b009 | 15002 | 0.5% |
| b002 | 14654 | 0.4% |
| b003 | 14515 | 0.4% |
| b011 | 14508 | 0.4% |
| b008 | 14472 | 0.4% |
| b014 | 14354 | 0.4% |
| b013 | 14298 | 0.4% |
| b006 | 14271 | 0.4% |
| Other values (16101) | 3151944 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 5865 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 30556 |
| Missing (%) | 0.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50433.12658 |
| Minimum | 1000 |
|---|---|
| Maximum | 97490 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.4 MiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 6600 |
| Q1 | 29300 |
| median | 49240 |
| Q3 | 75014 |
| 95-th percentile | 93160 |
| Maximum | 97490 |
| Range | 96490 |
| Interquartile range (IQR) | 45714 |
Descriptive statistics
| Standard deviation | 27431.69274 |
|---|---|
| Coefficient of variation (CV) | 0.5439221123 |
| Kurtosis | -1.196371386 |
| Mean | 50433.12658 |
| Median Absolute Deviation (MAD) | 23940 |
| Skewness | -0.004125740497 |
| Sum | 1.663582575 × 1011 |
| Variance | 752497766.8 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 75015 | 8066 | 0.2% |
| 69100 | 7890 | 0.2% |
| 21000 | 7417 | 0.2% |
| 31200 | 7403 | 0.2% |
| 35000 | 7308 | 0.2% |
| 54000 | 6509 | 0.2% |
| 51100 | 6480 | 0.2% |
| 75016 | 6477 | 0.2% |
| 75018 | 6232 | 0.2% |
| 14000 | 6077 | 0.2% |
| Other values (5855) | 3228732 | |
| (Missing) | 30556 | 0.9% |
| Value | Count | Frequency (%) |
| 1000 | 1743 | |
| 1090 | 347 | < 0.1% |
| 1100 | 1241 | |
| 1110 | 608 | < 0.1% |
| 1120 | 650 | < 0.1% |
| 1130 | 356 | < 0.1% |
| 1140 | 593 | < 0.1% |
| 1150 | 1118 | |
| 1160 | 717 | < 0.1% |
| 1170 | 1809 |
| Value | Count | Frequency (%) |
| 97490 | 1649 | |
| 97480 | 594 | < 0.1% |
| 97470 | 304 | < 0.1% |
| 97460 | 630 | < 0.1% |
| 97450 | 187 | < 0.1% |
| 97442 | 117 | < 0.1% |
| 97441 | 257 | < 0.1% |
| 97440 | 703 | |
| 97439 | 114 | < 0.1% |
| 97438 | 955 |
| Distinct | 30535 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 25.4 MiB |
| Toulouse | 27033 |
|---|---|
| Nice | 16665 |
| Nantes | 16149 |
| Montpellier | 15364 |
| Bordeaux | 14567 |
| Other values (30530) |
Length
| Max length | 45 |
|---|---|
| Median length | 10 |
| Mean length | 11.88880395 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 388 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Bourg-en-Bresse |
|---|---|
| 2nd row | Bourg-en-Bresse |
| 3rd row | Nivigne et Suran |
| 4th row | Nivigne et Suran |
| 5th row | Nivigne et Suran |
Common Values
| Value | Count | Frequency (%) |
| Toulouse | 27033 | 0.8% |
| Nice | 16665 | 0.5% |
| Nantes | 16149 | 0.5% |
| Montpellier | 15364 | 0.5% |
| Bordeaux | 14567 | 0.4% |
| Lille | 12736 | 0.4% |
| Rennes | 12102 | 0.4% |
| Saint-Étienne | 8794 | 0.3% |
| Paris 15e Arrondissement | 8152 | 0.2% |
| Villeurbanne | 8023 | 0.2% |
| Other values (30525) | 3189562 |
Length
| Value | Count | Frequency (%) |
| arrondissement | 121016 | 3.1% |
| la | 101164 | 2.6% |
| le | 95338 | 2.5% |
| paris | 68131 | 1.8% |
| les | 34551 | 0.9% |
| marseille | 31056 | 0.8% |
| toulouse | 27033 | 0.7% |
| lyon | 21829 | 0.6% |
| nice | 16665 | 0.4% |
| nantes | 16149 | 0.4% |
| Other values (30440) | 3332359 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
ancien_code_commune
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 592 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 3275534 |
| Missing (%) | 98.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 57625.90385 |
| Minimum | 1033 |
|---|---|
| Maximum | 95306 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.4 MiB |
Quantile statistics
| Minimum | 1033 |
|---|---|
| 5-th percentile | 11251 |
| Q1 | 35184 |
| median | 61228 |
| Q3 | 85048 |
| 95-th percentile | 93070 |
| Maximum | 95306 |
| Range | 94273 |
| Interquartile range (IQR) | 49864 |
Descriptive statistics
| Standard deviation | 27652.05033 |
|---|---|
| Coefficient of variation (CV) | 0.4798545183 |
| Kurtosis | -1.133457986 |
| Mean | 57625.90385 |
| Median Absolute Deviation (MAD) | 23966 |
| Skewness | -0.401882782 |
| Sum | 3089497583 |
| Variance | 764635887.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 85194 | 3433 | 0.1% |
| 91228 | 2148 | 0.1% |
| 93070 | 1904 | 0.1% |
| 78551 | 1488 | < 0.1% |
| 73257 | 860 | < 0.1% |
| 95306 | 834 | < 0.1% |
| 91182 | 830 | < 0.1% |
| 85146 | 769 | < 0.1% |
| 16192 | 716 | < 0.1% |
| 22093 | 678 | < 0.1% |
| Other values (582) | 39953 | 1.2% |
| (Missing) | 3275534 |
| Value | Count | Frequency (%) |
| 1033 | 481 | |
| 1036 | 68 | < 0.1% |
| 1059 | 5 | < 0.1% |
| 1091 | 276 | |
| 1097 | 22 | < 0.1% |
| 1122 | 62 | < 0.1% |
| 1130 | 66 | < 0.1% |
| 1154 | 22 | < 0.1% |
| 1185 | 278 | |
| 1186 | 19 | < 0.1% |
| Value | Count | Frequency (%) |
| 95306 | 834 | < 0.1% |
| 93070 | 1904 | |
| 91390 | 166 | < 0.1% |
| 91228 | 2148 | |
| 91222 | 4 | < 0.1% |
| 91182 | 830 | < 0.1% |
| 90073 | 48 | < 0.1% |
| 90068 | 27 | < 0.1% |
| 89448 | 10 | < 0.1% |
| 89421 | 8 | < 0.1% |
| Distinct | 591 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 3275534 |
| Missing (%) | 98.4% |
| Memory size | 25.4 MiB |
| Les Sables-d'Olonne | 3433 |
|---|---|
| Évry | 2148 |
| Saint-Ouen | 1904 |
| Saint-Germain-en-Laye | 1488 |
| Les Belleville | 860 |
| Other values (586) |
Length
| Max length | 30 |
|---|---|
| Median length | 11 |
| Mean length | 12.84334023 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Cras-sur-Reyssouze |
|---|---|
| 2nd row | Cras-sur-Reyssouze |
| 3rd row | Cras-sur-Reyssouze |
| 4th row | Cras-sur-Reyssouze |
| 5th row | Cras-sur-Reyssouze |
Common Values
| Value | Count | Frequency (%) |
| Les Sables-d'Olonne | 3433 | 0.1% |
| Évry | 2148 | 0.1% |
| Saint-Ouen | 1904 | 0.1% |
| Saint-Germain-en-Laye | 1488 | < 0.1% |
| Les Belleville | 860 | < 0.1% |
| Herblay | 834 | < 0.1% |
| Courcouronnes | 830 | < 0.1% |
| Montaigu | 769 | < 0.1% |
| Roumazières-Loubert | 716 | < 0.1% |
| Lamballe | 678 | < 0.1% |
| Other values (581) | 39953 | 1.2% |
| (Missing) | 3275534 |
Length
| Value | Count | Frequency (%) |
| les | 5928 | 9.0% |
| sables-d'olonne | 3433 | 5.2% |
| évry | 2148 | 3.3% |
| saint-ouen | 1904 | 2.9% |
| le | 1576 | 2.4% |
| saint-germain-en-laye | 1488 | 2.3% |
| la | 1486 | 2.3% |
| belleville | 1455 | 2.2% |
| herblay | 834 | 1.3% |
| courcouronnes | 830 | 1.3% |
| Other values (599) | 44855 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2032421 |
|---|---|
| Distinct (%) | 61.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 25.4 MiB |
| 31462000AL0070 | 1158 |
|---|---|
| 91570000AN0126 | 1051 |
| 95280000AB0305 | 1004 |
| 95018000BP0350 | 834 |
| 930630000T0252 | 796 |
| Other values (2032416) |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1621119 ? |
|---|---|
| Unique (%) | 48.7% |
Sample
| 1st row | 01053000AN0073 |
|---|---|
| 2nd row | 01053000AN0073 |
| 3rd row | 01095000AH0186 |
| 4th row | 01095000AH0186 |
| 5th row | 01095000AH0186 |
Common Values
| Value | Count | Frequency (%) |
| 31462000AL0070 | 1158 | < 0.1% |
| 91570000AN0126 | 1051 | < 0.1% |
| 95280000AB0305 | 1004 | < 0.1% |
| 95018000BP0350 | 834 | < 0.1% |
| 930630000T0252 | 796 | < 0.1% |
| 78586000AE0360 | 763 | < 0.1% |
| 33333000AR0002 | 714 | < 0.1% |
| 930100000P0163 | 677 | < 0.1% |
| 47001000AR0685 | 674 | < 0.1% |
| 42218000IN0024 | 652 | < 0.1% |
| Other values (2032411) | 3320824 |
Length
| Value | Count | Frequency (%) |
| 31462000al0070 | 1158 | < 0.1% |
| 91570000an0126 | 1051 | < 0.1% |
| 95280000ab0305 | 1004 | < 0.1% |
| 95018000bp0350 | 834 | < 0.1% |
| 930630000t0252 | 796 | < 0.1% |
| 78586000ae0360 | 763 | < 0.1% |
| 33333000ar0002 | 714 | < 0.1% |
| 930100000p0163 | 677 | < 0.1% |
| 47001000ar0685 | 674 | < 0.1% |
| 42218000in0024 | 652 | < 0.1% |
| Other values (2032411) | 3320824 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 10339 |
|---|---|
| Distinct (%) | 76.0% |
| Missing | 3315540 |
| Missing (%) | 99.6% |
| Memory size | 25.4 MiB |
| 78524000AD0005 | 488 |
|---|---|
| 91182000AP0062 | 165 |
| 91182000AP0061 | 154 |
| 91182000AN0020 | 88 |
| 91182000AB0141 | 62 |
| Other values (10334) |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 8924 ? |
|---|---|
| Unique (%) | 65.6% |
Sample
| 1st row | 011540000A0391 |
|---|---|
| 2nd row | 01154000ZC0027 |
| 3rd row | 01154000AA0344 |
| 4th row | 01154000AA0346 |
| 5th row | 01154000AA0348 |
Common Values
| Value | Count | Frequency (%) |
| 78524000AD0005 | 488 | < 0.1% |
| 91182000AP0062 | 165 | < 0.1% |
| 91182000AP0061 | 154 | < 0.1% |
| 91182000AN0020 | 88 | < 0.1% |
| 91182000AB0141 | 62 | < 0.1% |
| 782510000B0256 | 48 | < 0.1% |
| 78524000AB0101 | 34 | < 0.1% |
| 91182000AP0089 | 31 | < 0.1% |
| 91182000AN0528 | 30 | < 0.1% |
| 91182000AN0007 | 24 | < 0.1% |
| Other values (10329) | 12483 | 0.4% |
| (Missing) | 3315540 |
Length
| Value | Count | Frequency (%) |
| 78524000ad0005 | 488 | 3.6% |
| 91182000ap0062 | 165 | 1.2% |
| 91182000ap0061 | 154 | 1.1% |
| 91182000an0020 | 88 | 0.6% |
| 91182000ab0141 | 62 | 0.5% |
| 782510000b0256 | 48 | 0.4% |
| 78524000ab0101 | 34 | 0.2% |
| 91182000ap0089 | 31 | 0.2% |
| 91182000an0528 | 30 | 0.2% |
| 85166000ax0099 | 24 | 0.2% |
| Other values (10329) | 12483 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
lot1_surface_carrez
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWED| Distinct | 18727 |
|---|---|
| Distinct (%) | 6.5% |
| Missing | 3039921 |
| Missing (%) | 91.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 63.34290009 |
| Minimum | 0.1 |
|---|---|
| Maximum | 9999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.4 MiB |
Quantile statistics
| Minimum | 0.1 |
|---|---|
| 5-th percentile | 16.06 |
| Q1 | 33.6125 |
| median | 53.39 |
| Q3 | 73.63 |
| 95-th percentile | 118.81 |
| Maximum | 9999 |
| Range | 9998.9 |
| Interquartile range (IQR) | 40.0175 |
Descriptive statistics
| Standard deviation | 133.178849 |
|---|---|
| Coefficient of variation (CV) | 2.10250634 |
| Kurtosis | 2506.866421 |
| Mean | 63.34290009 |
| Median Absolute Deviation (MAD) | 19.98 |
| Skewness | 42.78337816 |
| Sum | 18320413.62 |
| Variance | 17736.60583 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12.5 | 950 | < 0.1% |
| 12 | 667 | < 0.1% |
| 15 | 498 | < 0.1% |
| 13 | 426 | < 0.1% |
| 10 | 377 | < 0.1% |
| 65 | 360 | < 0.1% |
| 60 | 353 | < 0.1% |
| 40 | 349 | < 0.1% |
| 14 | 345 | < 0.1% |
| 20 | 335 | < 0.1% |
| Other values (18717) | 284566 | 8.5% |
| (Missing) | 3039921 |
| Value | Count | Frequency (%) |
| 0.1 | 1 | |
| 0.28 | 1 | |
| 0.36 | 1 | |
| 0.51 | 1 | |
| 0.55 | 1 | |
| 0.6 | 1 | |
| 0.65 | 1 | |
| 0.7 | 1 | |
| 0.73 | 1 | |
| 0.8 | 2 |
| Value | Count | Frequency (%) |
| 9999 | 12 | |
| 9901 | 1 | < 0.1% |
| 9654 | 1 | < 0.1% |
| 9461.5 | 1 | < 0.1% |
| 9427 | 1 | < 0.1% |
| 9269.2 | 1 | < 0.1% |
| 7800 | 5 | |
| 7418 | 1 | < 0.1% |
| 7257 | 1 | < 0.1% |
| 7119 | 1 | < 0.1% |
lot2_surface_carrez
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWED| Distinct | 12460 |
|---|---|
| Distinct (%) | 17.6% |
| Missing | 3258320 |
| Missing (%) | 97.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 64.16103167 |
| Minimum | 0.1 |
|---|---|
| Maximum | 8284 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.4 MiB |
Quantile statistics
| Minimum | 0.1 |
|---|---|
| 5-th percentile | 22.75 |
| Q1 | 43 |
| median | 61.3 |
| Q3 | 76.74 |
| 95-th percentile | 111.744 |
| Maximum | 8284 |
| Range | 8283.9 |
| Interquartile range (IQR) | 33.74 |
Descriptive statistics
| Standard deviation | 60.48957233 |
|---|---|
| Coefficient of variation (CV) | 0.9427774267 |
| Kurtosis | 7123.291284 |
| Mean | 64.16103167 |
| Median Absolute Deviation (MAD) | 16.92 |
| Skewness | 62.02338371 |
| Sum | 4544333.39 |
| Variance | 3658.988361 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 65 | 84 | < 0.1% |
| 70 | 79 | < 0.1% |
| 60 | 68 | < 0.1% |
| 40 | 67 | < 0.1% |
| 50 | 64 | < 0.1% |
| 30 | 64 | < 0.1% |
| 67 | 62 | < 0.1% |
| 64 | 59 | < 0.1% |
| 68 | 59 | < 0.1% |
| 62 | 59 | < 0.1% |
| Other values (12450) | 70162 | 2.1% |
| (Missing) | 3258320 |
| Value | Count | Frequency (%) |
| 0.1 | 1 | |
| 0.35 | 1 | |
| 0.56 | 1 | |
| 0.6 | 1 | |
| 0.7 | 1 | |
| 0.75 | 1 | |
| 0.8 | 1 | |
| 0.85 | 1 | |
| 0.9 | 1 | |
| 0.94 | 1 |
| Value | Count | Frequency (%) |
| 8284 | 1 | |
| 6712 | 1 | |
| 2953 | 1 | |
| 2687.5 | 2 | |
| 1894.23 | 1 | |
| 1752.4 | 1 | |
| 1723.33 | 1 | |
| 1670.7 | 1 | |
| 1661.79 | 1 | |
| 1379 | 1 |
lot3_surface_carrez
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWED| Distinct | 4932 |
|---|---|
| Distinct (%) | 70.2% |
| Missing | 3322123 |
| Missing (%) | 99.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 84.62386247 |
| Minimum | 0.4 |
|---|---|
| Maximum | 8284 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.4 MiB |
Quantile statistics
| Minimum | 0.4 |
|---|---|
| 5-th percentile | 12.806 |
| Q1 | 38.895 |
| median | 61.775 |
| Q3 | 88.4625 |
| 95-th percentile | 170.0885 |
| Maximum | 8284 |
| Range | 8283.6 |
| Interquartile range (IQR) | 49.5675 |
Descriptive statistics
| Standard deviation | 235.1677423 |
|---|---|
| Coefficient of variation (CV) | 2.778976703 |
| Kurtosis | 493.6550396 |
| Mean | 84.62386247 |
| Median Absolute Deviation (MAD) | 24.475 |
| Skewness | 20.28587031 |
| Sum | 594398.01 |
| Variance | 55303.86702 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11.29 | 34 | < 0.1% |
| 12.5 | 31 | < 0.1% |
| 12 | 16 | < 0.1% |
| 70 | 14 | < 0.1% |
| 15 | 14 | < 0.1% |
| 21.57 | 13 | < 0.1% |
| 57.8 | 13 | < 0.1% |
| 40.02 | 13 | < 0.1% |
| 10 | 12 | < 0.1% |
| 27.58 | 12 | < 0.1% |
| Other values (4922) | 6852 | 0.2% |
| (Missing) | 3322123 |
| Value | Count | Frequency (%) |
| 0.4 | 1 | < 0.1% |
| 0.5 | 1 | < 0.1% |
| 0.53 | 1 | < 0.1% |
| 0.87 | 1 | < 0.1% |
| 1 | 4 | |
| 1.15 | 1 | < 0.1% |
| 1.18 | 1 | < 0.1% |
| 1.25 | 2 | |
| 1.29 | 1 | < 0.1% |
| 1.3 | 2 |
| Value | Count | Frequency (%) |
| 8284 | 1 | < 0.1% |
| 6800 | 1 | < 0.1% |
| 4503 | 1 | < 0.1% |
| 4331.4 | 11 | |
| 2692.3 | 2 | < 0.1% |
| 2159.87 | 1 | < 0.1% |
| 2104 | 1 | < 0.1% |
| 1894.23 | 1 | < 0.1% |
| 1209 | 1 | < 0.1% |
| 1163.34 | 1 | < 0.1% |
| Distinct | 792 |
|---|---|
| Distinct (%) | 6.2% |
| Missing | 3316424 |
| Missing (%) | 99.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 157.1274071 |
| Minimum | 2 |
|---|---|
| Maximum | 161313 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.4 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 8 |
| median | 24 |
| Q3 | 69 |
| 95-th percentile | 372 |
| Maximum | 161313 |
| Range | 161311 |
| Interquartile range (IQR) | 61 |
Descriptive statistics
| Standard deviation | 2171.604573 |
|---|---|
| Coefficient of variation (CV) | 13.82066066 |
| Kurtosis | 3816.304465 |
| Mean | 157.1274071 |
| Median Absolute Deviation (MAD) | 19 |
| Skewness | 58.24379501 |
| Sum | 1999132 |
| Variance | 4715866.42 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 778 | < 0.1% |
| 8 | 730 | < 0.1% |
| 7 | 705 | < 0.1% |
| 6 | 661 | < 0.1% |
| 4 | 649 | < 0.1% |
| 5 | 586 | < 0.1% |
| 3 | 343 | < 0.1% |
| 13 | 221 | < 0.1% |
| 2 | 220 | < 0.1% |
| 15 | 153 | < 0.1% |
| Other values (782) | 7677 | 0.2% |
| (Missing) | 3316424 |
| Value | Count | Frequency (%) |
| 2 | 220 | < 0.1% |
| 3 | 343 | |
| 4 | 649 | |
| 5 | 586 | |
| 6 | 661 | |
| 7 | 705 | |
| 8 | 730 | |
| 9 | 778 | |
| 11 | 6 | < 0.1% |
| 12 | 119 | < 0.1% |
| Value | Count | Frequency (%) |
| 161313 | 1 | |
| 131313 | 1 | |
| 102093 | 1 | |
| 32004 | 1 | |
| 20084 | 1 | |
| 16094 | 1 | |
| 13228 | 1 | |
| 13074 | 1 | |
| 13037 | 1 | |
| 13008 | 1 |
lot4_surface_carrez
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 1564 |
|---|---|
| Distinct (%) | 83.6% |
| Missing | 3327276 |
| Missing (%) | 99.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 115.9817157 |
| Minimum | 0.35 |
|---|---|
| Maximum | 4331.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.4 MiB |
Quantile statistics
| Minimum | 0.35 |
|---|---|
| 5-th percentile | 11.025 |
| Q1 | 36.42 |
| median | 67 |
| Q3 | 106.74 |
| 95-th percentile | 234.53 |
| Maximum | 4331.4 |
| Range | 4331.05 |
| Interquartile range (IQR) | 70.32 |
Descriptive statistics
| Standard deviation | 351.8314963 |
|---|---|
| Coefficient of variation (CV) | 3.033508293 |
| Kurtosis | 122.3391223 |
| Mean | 115.9817157 |
| Median Absolute Deviation (MAD) | 34.34 |
| Skewness | 10.72466346 |
| Sum | 217001.79 |
| Variance | 123785.4018 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12.5 | 14 | < 0.1% |
| 4331.4 | 11 | < 0.1% |
| 57.33 | 10 | < 0.1% |
| 52.85 | 10 | < 0.1% |
| 29.5 | 9 | < 0.1% |
| 70.5 | 8 | < 0.1% |
| 12 | 7 | < 0.1% |
| 10 | 6 | < 0.1% |
| 40 | 5 | < 0.1% |
| 29.13 | 5 | < 0.1% |
| Other values (1554) | 1786 | 0.1% |
| (Missing) | 3327276 |
| Value | Count | Frequency (%) |
| 0.35 | 1 | < 0.1% |
| 0.4 | 1 | < 0.1% |
| 0.8 | 1 | < 0.1% |
| 0.89 | 1 | < 0.1% |
| 0.92 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 1.25 | 2 | |
| 1.48 | 1 | < 0.1% |
| 1.8 | 2 | |
| 2 | 4 |
| Value | Count | Frequency (%) |
| 4331.4 | 11 | |
| 2687.2 | 2 | < 0.1% |
| 1750.5 | 1 | < 0.1% |
| 1681.9 | 1 | < 0.1% |
| 1348.01 | 1 | < 0.1% |
| 1251.44 | 1 | < 0.1% |
| 927.7 | 1 | < 0.1% |
| 894.64 | 1 | < 0.1% |
| 881.18 | 1 | < 0.1% |
| 776.72 | 1 | < 0.1% |
lot5_surface_carrez
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 647 |
|---|---|
| Distinct (%) | 84.4% |
| Missing | 3328380 |
| Missing (%) | > 99.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 121.0577705 |
| Minimum | 0.6 |
|---|---|
| Maximum | 8188 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.4 MiB |
Quantile statistics
| Minimum | 0.6 |
|---|---|
| 5-th percentile | 10.365 |
| Q1 | 29.615 |
| median | 62.14 |
| Q3 | 113.645 |
| 95-th percentile | 302.286 |
| Maximum | 8188 |
| Range | 8187.4 |
| Interquartile range (IQR) | 84.03 |
Descriptive statistics
| Standard deviation | 399.6793349 |
|---|---|
| Coefficient of variation (CV) | 3.301558695 |
| Kurtosis | 244.9651218 |
| Mean | 121.0577705 |
| Median Absolute Deviation (MAD) | 38.94 |
| Skewness | 14.16510403 |
| Sum | 92851.31 |
| Variance | 159743.5707 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 49.37 | 12 | < 0.1% |
| 117.9 | 10 | < 0.1% |
| 29.5 | 10 | < 0.1% |
| 90.19 | 8 | < 0.1% |
| 16.71 | 7 | < 0.1% |
| 12.5 | 5 | < 0.1% |
| 54.94 | 5 | < 0.1% |
| 11 | 4 | < 0.1% |
| 12 | 4 | < 0.1% |
| 66 | 4 | < 0.1% |
| Other values (637) | 698 | < 0.1% |
| (Missing) | 3328380 |
| Value | Count | Frequency (%) |
| 0.6 | 1 | |
| 0.93 | 1 | |
| 1 | 2 | |
| 1.33 | 1 | |
| 1.89 | 1 | |
| 2.75 | 1 | |
| 3 | 1 | |
| 3.2 | 1 | |
| 3.4 | 1 | |
| 3.52 | 1 |
| Value | Count | Frequency (%) |
| 8188 | 1 | |
| 4331.4 | 1 | |
| 3418.58 | 1 | |
| 2837.22 | 1 | |
| 2683.5 | 2 | |
| 997.88 | 1 | |
| 924 | 1 | |
| 625.95 | 1 | |
| 614.69 | 1 | |
| 591 | 1 |
| Distinct | 82 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3985618538 |
| Minimum | 0 |
|---|---|
| Maximum | 330 |
| Zeros | 2294729 |
| Zeros (%) | 68.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 330 |
| Range | 330 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8362805707 |
|---|---|
| Coefficient of variation (CV) | 2.098245386 |
| Kurtosis | 11491.95161 |
| Mean | 0.3985618538 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 46.98893099 |
| Sum | 1326871 |
| Variance | 0.6993651929 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2294729 | |
| 1 | 817196 | 24.5% |
| 2 | 180719 | 5.4% |
| 3 | 23780 | 0.7% |
| 4 | 6711 | 0.2% |
| 5 | 2346 | 0.1% |
| 6 | 1259 | < 0.1% |
| 7 | 668 | < 0.1% |
| 8 | 438 | < 0.1% |
| 9 | 284 | < 0.1% |
| Other values (72) | 1017 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2294729 | |
| 1 | 817196 | 24.5% |
| 2 | 180719 | 5.4% |
| 3 | 23780 | 0.7% |
| 4 | 6711 | 0.2% |
| 5 | 2346 | 0.1% |
| 6 | 1259 | < 0.1% |
| 7 | 668 | < 0.1% |
| 8 | 438 | < 0.1% |
| 9 | 284 | < 0.1% |
| Value | Count | Frequency (%) |
| 330 | 1 | |
| 223 | 1 | |
| 198 | 1 | |
| 136 | 1 | |
| 131 | 1 | |
| 121 | 1 | |
| 120 | 1 | |
| 119 | 1 | |
| 116 | 1 | |
| 112 | 1 |
code_type_local
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1512780 |
| Missing (%) | 45.4% |
| Memory size | 25.4 MiB |
| 1.0 | |
|---|---|
| 2.0 | |
| 3.0 | |
| 4.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 3.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 648413 | |
| 2.0 | 585969 | 17.6% |
| 3.0 | 450430 | 13.5% |
| 4.0 | 131555 | 4.0% |
| (Missing) | 1512780 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1.0 | 648413 | |
| 2.0 | 585969 | |
| 3.0 | 450430 | |
| 4.0 | 131555 | 7.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1512780 |
| Missing (%) | 45.4% |
| Memory size | 25.4 MiB |
| Maison | |
|---|---|
| Appartement | |
| Dépendance | |
| Local industriel. commercial ou assimilé |
Length
| Max length | 40 |
|---|---|
| Median length | 10 |
| Mean length | 11.06749737 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Appartement |
|---|---|
| 2nd row | Dépendance |
| 3rd row | Maison |
| 4th row | Maison |
| 5th row | Maison |
Common Values
| Value | Count | Frequency (%) |
| Maison | 648413 | |
| Appartement | 585969 | 17.6% |
| Dépendance | 450430 | 13.5% |
| Local industriel. commercial ou assimilé | 131555 | 4.0% |
| (Missing) | 1512780 |
Length
Pie chart
| Value | Count | Frequency (%) |
| maison | 648413 | |
| appartement | 585969 | |
| dépendance | 450430 | |
| assimilé | 131555 | 5.6% |
| ou | 131555 | 5.6% |
| commercial | 131555 | 5.6% |
| industriel | 131555 | 5.6% |
| local | 131555 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
surface_reelle_bati
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWED| Distinct | 4602 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 1970649 |
| Missing (%) | 59.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 115.3422611 |
| Minimum | 1 |
|---|---|
| Maximum | 277814 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 49 |
| median | 75 |
| Q3 | 104 |
| 95-th percentile | 190 |
| Maximum | 277814 |
| Range | 277813 |
| Interquartile range (IQR) | 55 |
Descriptive statistics
| Standard deviation | 806.5258823 |
|---|---|
| Coefficient of variation (CV) | 6.992457706 |
| Kurtosis | 32505.40326 |
| Mean | 115.3422611 |
| Median Absolute Deviation (MAD) | 27 |
| Skewness | 134.4592681 |
| Sum | 156692231 |
| Variance | 650483.9989 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80 | 26530 | 0.8% |
| 60 | 25286 | 0.8% |
| 90 | 23641 | 0.7% |
| 70 | 23502 | 0.7% |
| 100 | 20189 | 0.6% |
| 50 | 20006 | 0.6% |
| 40 | 19018 | 0.6% |
| 65 | 18973 | 0.6% |
| 75 | 16445 | 0.5% |
| 45 | 16022 | 0.5% |
| Other values (4592) | 1148886 | |
| (Missing) | 1970649 |
| Value | Count | Frequency (%) |
| 1 | 363 | < 0.1% |
| 2 | 311 | < 0.1% |
| 3 | 372 | < 0.1% |
| 4 | 195 | < 0.1% |
| 5 | 307 | < 0.1% |
| 6 | 363 | < 0.1% |
| 7 | 552 | < 0.1% |
| 8 | 933 | < 0.1% |
| 9 | 1172 | < 0.1% |
| 10 | 3439 |
| Value | Count | Frequency (%) |
| 277814 | 2 | |
| 215290 | 1 | |
| 207134 | 1 | |
| 134000 | 1 | |
| 132049 | 1 | |
| 123000 | 1 | |
| 112896 | 1 | |
| 106683 | 1 | |
| 100996 | 1 | |
| 99850 | 1 |
nombre_pieces_principales
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 46 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1515599 |
| Missing (%) | 45.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.353666956 |
| Minimum | 0 |
|---|---|
| Maximum | 90 |
| Zeros | 581298 |
| Zeros (%) | 17.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 90 |
| Range | 90 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.082816505 |
|---|---|
| Coefficient of variation (CV) | 0.8849240545 |
| Kurtosis | 3.945167896 |
| Mean | 2.353666956 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.653470421 |
| Sum | 4268488 |
| Variance | 4.338124595 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 581298 | 17.5% |
| 4 | 300776 | 9.0% |
| 3 | 299373 | 9.0% |
| 2 | 220792 | 6.6% |
| 5 | 174948 | 5.3% |
| 1 | 127847 | 3.8% |
| 6 | 68048 | 2.0% |
| 7 | 24857 | 0.7% |
| 8 | 8814 | 0.3% |
| 9 | 3389 | 0.1% |
| Other values (36) | 3406 | 0.1% |
| (Missing) | 1515599 |
| Value | Count | Frequency (%) |
| 0 | 581298 | |
| 1 | 127847 | 3.8% |
| 2 | 220792 | 6.6% |
| 3 | 299373 | |
| 4 | 300776 | |
| 5 | 174948 | 5.3% |
| 6 | 68048 | 2.0% |
| 7 | 24857 | 0.7% |
| 8 | 8814 | 0.3% |
| 9 | 3389 | 0.1% |
| Value | Count | Frequency (%) |
| 90 | 1 | |
| 63 | 1 | |
| 55 | 1 | |
| 52 | 1 | |
| 50 | 1 | |
| 48 | 1 | |
| 41 | 1 | |
| 40 | 1 | |
| 39 | 1 | |
| 38 | 1 |
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1050173 |
| Missing (%) | 31.5% |
| Memory size | 25.4 MiB |
| S | |
|---|---|
| T | |
| P | |
| AB | |
| J | |
| Other values (22) |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.204297636 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | S |
|---|---|
| 2nd row | AG |
| 3rd row | AG |
| 4th row | S |
| 5th row | S |
Common Values
| Value | Count | Frequency (%) |
| S | 1052935 | |
| T | 349321 | 10.5% |
| P | 180010 | 5.4% |
| AB | 131207 | 3.9% |
| J | 115605 | 3.5% |
| BT | 97939 | 2.9% |
| L | 93441 | 2.8% |
| AG | 78818 | 2.4% |
| VI | 42017 | 1.3% |
| BR | 34449 | 1.0% |
| Other values (17) | 103232 | 3.1% |
| (Missing) | 1050173 |
Length
| Value | Count | Frequency (%) |
| s | 1052935 | |
| t | 349321 | 15.3% |
| p | 180010 | 7.9% |
| ab | 131207 | 5.8% |
| j | 115605 | 5.1% |
| bt | 97939 | 4.3% |
| l | 93441 | 4.1% |
| ag | 78818 | 3.5% |
| vi | 42017 | 1.8% |
| br | 34449 | 1.5% |
| Other values (17) | 103232 | 4.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1050173 |
| Missing (%) | 31.5% |
| Memory size | 25.4 MiB |
| sols | |
|---|---|
| terres | |
| prés | |
| terrains a bâtir | |
| jardins | |
| Other values (22) |
Length
| Max length | 19 |
|---|---|
| Median length | 4 |
| Mean length | 6.717378083 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | sols |
|---|---|
| 2nd row | terrains d'agrément |
| 3rd row | terrains d'agrément |
| 4th row | sols |
| 5th row | sols |
Common Values
| Value | Count | Frequency (%) |
| sols | 1052935 | |
| terres | 349321 | 10.5% |
| prés | 180010 | 5.4% |
| terrains a bâtir | 131207 | 3.9% |
| jardins | 115605 | 3.5% |
| taillis simples | 97939 | 2.9% |
| landes | 93441 | 2.8% |
| terrains d'agrément | 78818 | 2.4% |
| vignes | 42017 | 1.3% |
| futaies résineuses | 34449 | 1.0% |
| Other values (17) | 103232 | 3.1% |
| (Missing) | 1050173 |
Length
| Value | Count | Frequency (%) |
| sols | 1052935 | |
| terres | 349428 | 12.5% |
| terrains | 210025 | 7.5% |
| prés | 182504 | 6.5% |
| a | 131207 | 4.7% |
| bâtir | 131207 | 4.7% |
| jardins | 115605 | 4.1% |
| taillis | 114762 | 4.1% |
| simples | 97939 | 3.5% |
| landes | 93800 | 3.4% |
| Other values (24) | 318731 | 11.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 125 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 3175092 |
| Missing (%) | 95.4% |
| Memory size | 25.4 MiB |
| POTAG | |
|---|---|
| PATUR | |
| PIN | |
| PARC | |
| FRICH | |
| Other values (120) |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.483165103 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | JARD |
|---|---|
| 2nd row | JARD |
| 3rd row | ETANG |
| 4th row | PARC |
| 5th row | PATUR |
Common Values
| Value | Count | Frequency (%) |
| POTAG | 32746 | 1.0% |
| PATUR | 16277 | 0.5% |
| PIN | 12625 | 0.4% |
| PARC | 12578 | 0.4% |
| FRICH | 9834 | 0.3% |
| VAOC | 7598 | 0.2% |
| IMM | 5027 | 0.2% |
| CHAT | 4875 | 0.1% |
| PACAG | 3476 | 0.1% |
| MARAI | 3451 | 0.1% |
| Other values (115) | 45568 | 1.4% |
| (Missing) | 3175092 |
Length
| Value | Count | Frequency (%) |
| potag | 32746 | |
| patur | 16277 | 10.6% |
| pin | 12625 | 8.2% |
| parc | 12578 | 8.2% |
| frich | 9834 | 6.4% |
| vaoc | 7598 | 4.9% |
| imm | 5027 | 3.3% |
| chat | 4875 | 3.2% |
| pacag | 3476 | 2.3% |
| marai | 3451 | 2.2% |
| Other values (115) | 45568 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 125 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 3175092 |
| Missing (%) | 95.4% |
| Memory size | 25.4 MiB |
| Jardin potager | |
|---|---|
| Pâture plantée | |
| Pins | |
| Parc | |
| Friche | |
| Other values (120) |
Length
| Max length | 38 |
|---|---|
| Median length | 14 |
| Mean length | 12.7497582 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Jardin d'agrément |
|---|---|
| 2nd row | Jardin d'agrément |
| 3rd row | Etangs |
| 4th row | Parc |
| 5th row | Pâture plantée |
Common Values
| Value | Count | Frequency (%) |
| Jardin potager | 32746 | 1.0% |
| Pâture plantée | 16277 | 0.5% |
| Pins | 12625 | 0.4% |
| Parc | 12578 | 0.4% |
| Friche | 9834 | 0.3% |
| Vins d'appellation d'origine contrôlée | 7598 | 0.2% |
| Dépendances d'ensemble immobilier | 5027 | 0.2% |
| Châtaigneraie | 4875 | 0.1% |
| Pacage | 3476 | 0.1% |
| Pré marais | 3451 | 0.1% |
| Other values (115) | 45568 | 1.4% |
| (Missing) | 3175092 |
Length
| Value | Count | Frequency (%) |
| jardin | 34246 | 12.1% |
| potager | 32746 | 11.6% |
| pâture | 16277 | 5.8% |
| plantée | 16277 | 5.8% |
| pins | 12625 | 4.5% |
| parc | 12584 | 4.5% |
| friche | 9834 | 3.5% |
| ou | 8488 | 3.0% |
| vins | 7818 | 2.8% |
| d'origine | 7598 | 2.7% |
| Other values (159) | 123872 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 46290 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 1050235 |
| Missing (%) | 31.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3113.500546 |
| Minimum | 1 |
|---|---|
| Maximum | 4625500 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 25.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 30 |
| Q1 | 236 |
| median | 627 |
| Q3 | 1933 |
| 95-th percentile | 12491.45 |
| Maximum | 4625500 |
| Range | 4625499 |
| Interquartile range (IQR) | 1697 |
Descriptive statistics
| Standard deviation | 14470.70868 |
|---|---|
| Coefficient of variation (CV) | 4.647729612 |
| Kurtosis | 17242.01383 |
| Mean | 3113.500546 |
| Median Absolute Deviation (MAD) | 503 |
| Skewness | 82.35419667 |
| Sum | 7095393757 |
| Variance | 209401409.8 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 500 | 43063 | 1.3% |
| 1000 | 20255 | 0.6% |
| 600 | 6636 | 0.2% |
| 800 | 6628 | 0.2% |
| 12 | 5790 | 0.2% |
| 400 | 5397 | 0.2% |
| 700 | 5340 | 0.2% |
| 13 | 5319 | 0.2% |
| 100 | 5150 | 0.2% |
| 300 | 5143 | 0.2% |
| Other values (46280) | 2170191 | |
| (Missing) | 1050235 |
| Value | Count | Frequency (%) |
| 1 | 4712 | |
| 2 | 4071 | |
| 3 | 3446 | |
| 4 | 3776 | |
| 5 | 3895 | |
| 6 | 3726 | |
| 7 | 3656 | |
| 8 | 3609 | |
| 9 | 3334 | |
| 10 | 4530 |
| Value | Count | Frequency (%) |
| 4625500 | 1 | < 0.1% |
| 3989055 | 1 | < 0.1% |
| 3805880 | 1 | < 0.1% |
| 3591800 | 1 | < 0.1% |
| 3032771 | 1 | < 0.1% |
| 2960000 | 3 | |
| 2889300 | 1 | < 0.1% |
| 2745500 | 1 | < 0.1% |
| 2636576 | 1 | < 0.1% |
| 2585700 | 1 | < 0.1% |
| Distinct | 1790095 |
|---|---|
| Distinct (%) | 55.0% |
| Missing | 72660 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.234953837 |
| Minimum | -63.146385 |
|---|---|
| Maximum | 55.826361 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 751121 |
| Negative (%) | 22.6% |
| Memory size | 25.4 MiB |
Quantile statistics
| Minimum | -63.146385 |
|---|---|
| 5-th percentile | -2.2577305 |
| Q1 | 0.205196 |
| median | 2.336341 |
| Q3 | 4.446119 |
| 95-th percentile | 6.5915134 |
| Maximum | 55.826361 |
| Range | 118.972746 |
| Interquartile range (IQR) | 4.240923 |
Descriptive statistics
| Standard deviation | 6.365984938 |
|---|---|
| Coefficient of variation (CV) | 2.84837424 |
| Kurtosis | 66.46723583 |
| Mean | 2.234953837 |
| Median Absolute Deviation (MAD) | 2.124593 |
| Skewness | -1.624439272 |
| Sum | 7278098.116 |
| Variance | 40.52576423 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.513143 | 1158 | < 0.1% |
| 2.317785 | 1052 | < 0.1% |
| 2.455177 | 1004 | < 0.1% |
| 2.232054 | 834 | < 0.1% |
| 2.201739 | 764 | < 0.1% |
| -1.213741 | 714 | < 0.1% |
| 2.48249 | 677 | < 0.1% |
| 4.410593 | 652 | < 0.1% |
| -1.213963 | 613 | < 0.1% |
| 1.713259 | 598 | < 0.1% |
| Other values (1790085) | 3248421 | |
| (Missing) | 72660 | 2.2% |
| Value | Count | Frequency (%) |
| -63.146385 | 6 | |
| -63.145312 | 3 | < 0.1% |
| -63.145286 | 8 | |
| -63.144002 | 3 | < 0.1% |
| -63.143845 | 2 | < 0.1% |
| -63.140141 | 4 | |
| -63.140095 | 2 | < 0.1% |
| -63.138125 | 1 | < 0.1% |
| -63.134788 | 1 | < 0.1% |
| -63.131714 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 55.826361 | 2 | |
| 55.825837 | 1 | |
| 55.825179 | 1 | |
| 55.824285 | 1 | |
| 55.824192 | 1 | |
| 55.823385 | 1 | |
| 55.823287 | 1 | |
| 55.822873 | 1 | |
| 55.822608 | 1 | |
| 55.821368 | 2 |
| Distinct | 1731438 |
|---|---|
| Distinct (%) | 53.2% |
| Missing | 72660 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 46.13576713 |
| Minimum | -21.386772 |
|---|---|
| Maximum | 51.082118 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 16431 |
| Negative (%) | 0.5% |
| Memory size | 25.4 MiB |
Quantile statistics
| Minimum | -21.386772 |
|---|---|
| 5-th percentile | 43.2150312 |
| Q1 | 44.7221355 |
| median | 46.725688 |
| Q3 | 48.6938485 |
| 95-th percentile | 49.884219 |
| Maximum | 51.082118 |
| Range | 72.46889 |
| Interquartile range (IQR) | 3.971713 |
Descriptive statistics
| Standard deviation | 5.771072897 |
|---|---|
| Coefficient of variation (CV) | 0.1250889116 |
| Kurtosis | 95.67778275 |
| Mean | 46.13576713 |
| Median Absolute Deviation (MAD) | 1.976642 |
| Skewness | -8.896969398 |
| Sum | 150240525.9 |
| Variance | 33.30528238 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 43.670749 | 1159 | < 0.1% |
| 48.63569 | 1051 | < 0.1% |
| 49.035396 | 1004 | < 0.1% |
| 48.945538 | 836 | < 0.1% |
| 48.944566 | 764 | < 0.1% |
| 44.839384 | 715 | < 0.1% |
| 48.903144 | 677 | < 0.1% |
| 45.417475 | 652 | < 0.1% |
| 44.848046 | 613 | < 0.1% |
| 48.974742 | 600 | < 0.1% |
| Other values (1731428) | 3248416 | |
| (Missing) | 72660 | 2.2% |
| Value | Count | Frequency (%) |
| -21.386772 | 5 | |
| -21.385952 | 1 | < 0.1% |
| -21.385389 | 2 | < 0.1% |
| -21.384999 | 4 | |
| -21.384806 | 1 | < 0.1% |
| -21.384644 | 7 | |
| -21.384108 | 1 | < 0.1% |
| -21.383799 | 2 | < 0.1% |
| -21.383615 | 1 | < 0.1% |
| -21.38356 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 51.082118 | 6 | |
| 51.082045 | 2 | < 0.1% |
| 51.081947 | 5 | |
| 51.081765 | 6 | |
| 51.08171 | 2 | < 0.1% |
| 51.081631 | 3 | < 0.1% |
| 51.081576 | 8 | |
| 51.081375 | 1 | < 0.1% |
| 51.080942 | 2 | < 0.1% |
| 51.080765 | 2 | < 0.1% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.